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DNA RECOMBINATION IN EUKARYOTIC CELLS BY THE 
BACTERIOPHAGE PHIC31 RECOMBINATION SYSTEM 

5 

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH 

This invention was made with goveniment support under Grant No. 5335- 
21000-009-06S, awarded by the United States Department of Agriculture, Agricultural 
Research Service. The Government has certain rights in the invention. 

10 CROSS-REFERENCE TO RELATED APPLICATION 

This application claims the benefit of US Provisional Application No. 
60/145,469, filed July 23, 1999, which application is incorporated herem by reference. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

15 This invention pertains to the field of methods for obtaining specific and 

stable integration of nucleic acids into chromosomes of eukaryotes. The invention makes use 
of site-specific recombination systems that use prokaryotic recombinase polypeptides, such 
as the <I>C31 integrase. 

Background 

20 Genetic transformation of eukaryotes often suffers fi-om significant 

shortcomings. For example, it is often difficult to reproducibly obtain integration of a 
transgene at a particular locus of interest. Homologous recombination generally occurs only 
at a very low firequency. To overcome this problem, site-specific recombination systems 
have been employed. These methods involve the use of site-specific recombination systems 

25 that can operate in higher eucaryotic cells. 

Many bacteriophage and integrative plasmids encode site-specific 
recombination systems that enable the stable incorporation of their genome into those of 
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their hosts. In these systems, the minimal requirements for the recombination reaction are a 
recombinase enzyme, or integrase, which catalyzes the recombination event, and two 
recombination sites (Sadowski (1986) 7. BacterioL 165: 341-347; Sadowski (1993) FASEB 
J. 7: 760-767). For phage integration systems, these are referred to as attachment {att) sites, 
5 with an attP element from phage DNA and the attB element encoded by the bacterial 
genome. The two attachment sites can share as little sequence identity as a few base pairs. 
The recombinase protein binds to both att sites and catalyzes a conservative and reciprocal 
exchange of DNA strands that result in integration of the circular phage or plasmid DNA 
into host DNA. Additional phage or host factors, such as the DNA bending protein IHF, 

10 integration Aost factor, may be required for an efficient reaction (Friedman (1 988) Cell 

55:545-554; Finkel & Johnson (1992) Mol Microbiol 6: 3257-3265). The reverse excision 
reaction sometimes requires an additional phage factor, such as the xis gene product of phage 
X (Weisberg & Landy (1983) "Site-specific recombination in phage lambda." In Lambda II, 
eds, Hendrix et al (Cold Spring Haibor Laboratory, Cold Spring Harbor, NY) pp,21 1-250; 

15 Landy (1989) Ann, Rev. Biochem. 58: 913-949. 

The recombinases have been categorized into two groups, the X integrase 
(Argos et al (1986) EMBO J. 5: 433-44; Voziyanov et al (1999) Nucl Acids Res. 27: 930- 
941) and the resolvase/invertase (Hatfiill & Grindley (1988) "Resolvases and DNA- 
invertases: a family of enzymes active in site-specific recombination" In Genetic 

20 Recombination, eds. Kucherlipati, R., & Smith, G. R. (Am. Soc. Microbiol., Washington 
DC), pp. 357-396) families. These vary in the structure of the integrase enzymes and the 
molecular details of then- mode of catalysis (Stark et al (1992) Trends Genetics 8: 432-439). 
The temperate Streptomyces phage OC31 encodes a 68 kD recombinase of the latter class. 
The efficacy of the OC31 integrase enzyme in recombining its cognate attachment sites was 

25 recently demonstrated in vitro and in vivo in recA mutant Escherichia coli (Thorpe & Smith 
(1998) Proc, Natl Acad. Scl USA 95: 5505-5510). The a>C31 integration reaction is simple 
in that it does not require a host factor and appears irreversible, most likely because an 
additional phage protein is required for excision. The phage and bacterial att sites share only 
three base pairs of homology at the point of cross-over. This homology is flanked by 

30 inverted repeats, presumably binding sites for the integrase protem. The minimal known 
functional size for both attB and attP is --50 bp. 
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The CxQrlox system of bacteriophage PI, and the FLP-FRT system of 
Saccharomyces cerevisiae have been widely used for transgene and chromosome 
engineering in animals and plants (reviewed by Sauer (1994) Curr. Opin. BiotechnoL 5: 521- 
527; Ow (1996) Curr. Opin. BiotechnoL 7: 181-186). Other systems that operate in animal 

5 or plant cells include the following: 1) the R-RS system from Zygosaccharomyces rouxii 
(Onouchi et al (1995) Mol Gen, Genet. 247: 653-660), 2) the Gin-gix system from 
bacteriophage Mu (Maeser & Kahmann (1991) Mol Gen. Genet. 230: 170-176) and, 3) the p 
recombinase-jrix system from bacterial plasmid pSM19035 (Diaz et al (1999) J. Biol Chem. 
274: 6634-6640). By using the site-specific recombinases, one can obtain a greater frequency 

10 of integration. 

However, these five systems suffer from a significant shortcoming. Each of 
these systems have in common the property that a single polypeptide recombinase catalyzes 
the recombination between two sites of identical or nearly identical sequences. The product- 
sites generated by recombination are themselves substrates for subsequent recombination, 
1 5 Consequently, recombination reactions are readily reversible. Since the kinetics of 

intramolecular interactions are favored over intennolecular interactions, these recombination 
systems are efficient for deleting rather than integrating DNA. Thus, a need exists for 
methods and systems for obtaining stable site-specific integration of transgenes. The present 
invention fulfills this and other needs. 

20 SUMMARY OF THE INVENTION 

The present invention provides methods for obtaining stable, site-specific 
recombination in a exikaryotic cell. Unlike previously known methods for site-specific 
recombination, the recombinants obtained using the methods of the invention are stable. The 
recombination reaction is not reversible. 

25 The methods involve providing a eukaryotic cell that comprises a first 

recombination site and a second recombination site, which second recombination site can 
serve as a substrate for recombination with the first recombination site. The first and the 
second recombination sites are contacted with a prokaryotic recombinase polypeptide, 
resulting in recombination between the recombination sites, thereby forming one or two 

30 hybrid recombination sites. Significantly, the recombinase polypeptide is one that can 
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mediate site-specific recombination between the first and second recombination sites, but 
cannot mediate recombination between the two hybrid recombination sites in the absence of 
an additional phage-produced factor that is not present in the eukaryotic cell. Either or both 
of the recombination sites can be present in a chromosome of the eukaryotic cell. In some 
5 embodiments, one of the recombination sites is present in the chromosome and the other is 
included within a nucleic acid that is to be integrated into the chromosome. 

The invention also provides eukaryotic cells that contain a prokaryotic 
recombinase polypeptide or a nucleic acid that encodes a prokaryotic recombinase. In these 
embodiments, the recombinase is one that can mediate site-specific recombmation between a 

10 first recombination site and a second recombination site that can serve as a substrate for 
recombination with the first recombination site, but in the absence of an additional factor 
that is not present in the eukaryotic cell cannot mediate recombination between two hybrid 
recombination sites that are formed upon recombination between the first recombination site 
and the second recombination site. In presently preferred embodiments, the cells of the 

1 5 invention include a nucleic acid that has a coding sequence for a recombinase polypeptide. 
The recombinase coding sequence is preferably operably linked to a promoter that mediates 
expression of the recombinase-encoding polynucleotide in the eukaryotic cell. The 
eukaryotic cells of the invention can be an animal cell, a plant cell, a yeast cell or a fimgal 
cell, for example. 

20 In additional embodiments, the invention provides methods for obtaining a 

eukaryotic cell having a stably integrated transgene. These methods involve introducing a 
nucleic acid into a eukaryotic cell that comprises a first recombination site, wherein the 
nucleic acid comprises the transgene of interest and a second recombination site which can 
serve as a substrate for recombination with the first recombination site. The first and second 

25 recombination sites are contacted with a prokaryotic recombinase polypeptide. The 
recombinase polypeptide catalyzes recombination between the first and second 
recombination sites, resulting in integration of the nucleic acid at the first recombination site, 
thereby forming a hybrid recombination site at each end of the nucleic acid. Again, the 
recombinase polypeptide is one that can mediate site-specific recombination between the 

30 first and second recombination sites, but cannot mediate recombination between two hybrid 
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recombination sites in the absence of an additional factor that is not present in the eukaiyotic 
ceU. 

Additional embodiments of the invention provide nucleic acids that include a 
polynucleotide sequence that encodes a bacterial recombinase polypeptide operably linked to 

5 a promoter that functions in a eukaiyotic cell. The recombinase polypeptides encoded by 
these nucleic acids of the invention cannot mediate recombination between two hybrid 
recombination sites that are formed upon recombination between a first recombination site 
and a second recombination site in the absence of a bacteriophage factor that is not present in 
the eukaiyotic cells. In some embodiments, the nucleic acids further include at least one 

1 0 recombination site that is recognized by the recombinase polypeptide. 

Also provided by the invention are eukaiyotic cells that include a 
polynucleotide that has one or more bacteriophage <I>C31 recombination sites, or 
recombination sites for other recombinases that cannot mediate recombination between two 
hybrid recombination sites that are formed upon recombination between a first 

15 recombination site and a second recombination site in the absence of a bacteriophage factor 
that is not present in the eukaryotic cells. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a schematic (not to scale) representation of the chromosome 
structure at the 5. pombe leul locus. Homologous msertion of pLT44 into the chromosome 

20 (Figure 1 A) places a <E>C3 1 attP target between leul alleles as shown in Figure IB. pLT43 
promoted site-specific integration of pLT45 into the chromosomal attP target leads to the 
structure shown in Figure IC. Arrowheads indicate PGR primers corresponding to the T7 
promoter (T7), T3 promoter (T3) and ura4^ coding region (U4). Predicted sizes of (X) 
cleavage products are shown. 

25 Figure 2 shows a schematic of an experiment which demonstrated that <DC3 1 

integrase catalyzes site-specific integration of a transgene encoding green fluorescent protein 

(GFP) in CHO cells. 

Figure 3 shows a schematic diagram of an experiment which demonstrated 
that <1>C3 1 catalyzes specific recombination at an attB site to insert a hygromycin 
30 phosphotransferase gene downstream of a chromosomally located promoter. Successful 
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integration produces a Pc-attL-hpt linkage and a hygromycin resistance phenotype. The 
effect of different lengths of atiP and attB sites were analyzed using the plasmids indicated. 

Figure 4 shows a schematic diagram of an experiment which demonstrates 
that <t>C3l integrase catalyzes the excision of a DNA flanked by attB and attP sites from the 
5 tobacco genome. 

Figure 5 shows a schematic diagram of an experiment in which <I)C31 
integrase was shown to catalyze integration of a transgene into the tobacco genome. 

DETAILED DESCMPTION 

Definitions 

1 0 An "exogenous DNA segment", '^heterologous polynucleotide" a 'transgene" 

or a "heterologous nucleic acid", as used herein, is one that originates from a source foreign 
to the particular host cell, or, if from the same source, is modified from its original form. 
Thus, a heterologous gene in a host cell includes a gene that is endogenous to the particular 
host cell, but has been modified. Thus, the terms refer to a DNA segment which is foreign or 

1 5 heterologous to the cell, or homologous to the cell but in a position within the host cell 
nucleic acid in which the element is not ordinarily found. Exogenous DNA segments are, 
expressed to yield exogenous polypeptides. 

The term "gene" is used broadly to refer to any segment of DNA associated 
with a biological function. Thus, genes include coding sequences and/or the regulatory 

20 sequences required for their expression. Genes can also include nonexpressed DNA 

segments that, for example, form recognition sequences for other proteins. Genes can be 
obtained from a variety of sources, including cloning from a source of interest or 
synthesizing from known or predicted sequence information, and may include sequences 
designed to have desired parameters. 

25 The term "isolated", when applied to a nucleic acid or protein, denotes that 

the nucleic acid or protein is essentially free of other cellular components with which it is 
associated in the natural state. It is preferably in a homogeneous state although it can be in 
either a dry or aqueous solution. Purity and homogeneity are typically determined using 
analytical cheraistiy techniques such as polyacrylamide gel electrophoresis or high 

30 performance liquid chromatography. A protein which is the predominant species present in a 
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preparation is substantially purified. In particular, an isolated gene is separated fix>m open 
reading jframes which flank the gene and encode a protein other than the gene of interest. 
The term **purified** denotes that a nucleic acid or protein gives rise to essentially one band 
in an electrophoretic gel. Particularly, it means that the nucleic acid or protein is at least 
5 about 50% pure, more preferably at least about 85% pure, and most preferably at least about 
99% pure. 



in nature as distinct 6om being artificially produced by man. For example, a polypeptide or 
polynucleotide sequence that is present in an organism (mcluding viruses) that can be 
1 0 isolated firom a soiu*ce in nature and which has not been intentionally modified by man in the 
laboratory is naturally-occurring. 



or ribonucleotides and polymers thereof in either single- or double-stranded form. Unless 
specifically limited, the term encompasses nucleic acids containing known analogues of 

15 natural nucleotides which have similar binding properties as the reference nucleic acid and 
are metabolized in a manner similar to naturally occurring nucleotides. Unless otherwise 
indicated, a particular nucleic acid sequence also impUcitly encompasses conservatively 
modified variants thereof (eg. degenerate codon substitutions) and complementary 
sequences and as well as the sequence explicitly indicated. Specifically, degenerate codon 

20 substitutions may be achieved by generating sequences in which the third position of one or 
more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues 
(Batzer et al (1991) Nucleic Acid Res. 19: 5081; Ohtsuka et al (1985) /. Biol Chem. 260: 
2605-2608; Cassol et al (1992) ; Rossolmi et al (1994) Ma/. Cell Probes 8: 91-98). The 
term nucleic acid is used interchangeably with gene, cDNA, and mRNA encoded by a gene. 

25 **Nucleic acid derived firom a gene'* refers to a nucleic acid for whose 

synthesis a gene, or a subsequence thereof (eg., coding region), has ultimately served as a 
template. Thus, an mRNA, a cDNA reverse transcribed from an mRNA, an RNA transcribed 
fix)m that cDNA, a DNA amplified from the cDNA, an RNA transcribed from the amplified 
DNA, etc, are all derived from the gene and detection of such derived products is indicative 

30 of the presence and/or abundance of the original. 



The terai ^^naturally-occurring" is used to describe an object that can be found 



The term **nucleic acid" or '^polynucleotide" refers to deoxyribonucleotides 
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A DNA segment is "operably linked" when placed into a functional 
relationship with another DNA segment For example, DNA for a signal sequence is 
operably linked to DNA encoding a polypeptide if it is expressed as a preprotein that 
participates in the secretion of the polypeptide; a promoter or enhancer is operably linked to 
5 a coding sequence if it stimulates the transcription of the sequence. Generally, DNA 

sequences that are operably linked are contiguous, and in the case of a signal sequence both 
contiguous and in reading phase. However, enhancers, for example, need not be contiguous 
with the coding sequences whose transcription they control. Linking is accomplished by 
ligation at convenient restriction sites or at adapters or linkers inserted in lieu thereof 

1 0 *Tlant" includes whole plants, plant organs (e.g., leaves, stems, roots, etc.), 

seeds and plant cells and progeny of same. The class of plants that can be used in the 
methods of the invention is generally as broad as the class of higher plants amenable to 
transformation techniques, including both monocotyledonous and dicotyledonous plants, 
"Promoted refers to a region of DNA involved in binding the RNA 

15 polymerase to initiate transcription. An "inducible promoter^' refers to a promoter that directs 
expression of a gene where the level of expression is alterable by environmental or 
developmental factors such as, for example, temperature, pH, transcription factors and 
chemicals. 

The term "recombinant" when used with reference to a cell indicates that the 
20 cell replicates a heterologous nucleic acid, or expresses a peptide or protein encoded by a 
heterologous nucleic acid. Recombinant cells can contain polynucleotides that are not found 
within the native (non-recombinant) form of the cell. Recombinant cells can also contain 
polynucleotides found in the native form of the cell wherein the polynucleotides are 
modified and re-introduced into the cell by artificial means. The term also encompasses cells 
25 that contain a nucleic acid endogenous to the cell that has been modified without removing 
the nucleic acid firom the cell; such modifications include those obtained by gene 
replacement, site-specific mutation, and related techniques. 

A **recombinant expression cassette" or simply an "expression cassette" is a 
nucleic acid construct, generated recombinantly or synthetically, with nucleic acid elements 
30 that are capable of effecting expression of a structural gene in hosts compatible with such 
sequences. Expression cassettes include at least promoters and optionally, transcription 
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tennination signals. Typically, the recombinant expression cassette includes a nucleic acid 

to be transcribed (e.g., a nucleic acid encoding a desired polypeptide), and a promoter. 

Additional factors necessary or helpful in efifecting expression may also be used as described 

herein. For example, an expression cassette can also include nucleotide sequences that 
5 encode a signal sequence that directs secretion of an expressed protein from the host cell. 

Transcription termination signals, enhancers, and other nucleic acid sequences that influence 

gene e5q)ression, can also be included in an expression cassette. 

**Recombinase" refers to an enzyme that catalyzes recombination between 

two or more recombination sites. Recombinases useful in the present invention catalyze 
10 recombination at specific recombination sites which are specific polynucleotide sequences 

that are recognized by a particular recombinase. The term "integrase" refers to a type of 

recombinase. 

'Transformation rate" refers to the percent of cells that successfully 
incorporate a heterologous polynucleotide into its genome and survive. 

15 The term "transgenic" refers to a cell that includes a specific modification that 

was introduced into the cell, or into an ancestor of the cell. Such modifications can include 
one or more point mutations, deletions, insertions, or combinations thereof When referring 
to an animal, the term **transgenic" means that the animal includes cells that are transgenic. 
An animal that is composed of both transgenic and non-transgenic cells is referred to herem 

20 as a "chimeric" animal. 

The term 'Vector" refers to a composition for transferring a nucleic acid (or 
nucleic acids) to a host cell. A vector comprises a nucleic acid encoding the nucleic acid to 
be transferred, and optionally comprises a viral capsid or other mat^ials for facilitating entry 
of the nucleic acid into the host cell and/or replication of the vector m the host cell (e.g., 

25 reverse transcriptase or other enzymes which are packaged within the capsid, or as part of 
the capsid). 

"Recombination sites" are specific polynucleotide sequences that are 
recognized by the recombinase enzymes described herein. Typically, two different sites are 
involved (termed "complementary sites*"), one present in the target nucleic acid (eg., a 
30 chromosome or episome of a eukaryote) and another on the nucleic acid that is to be 
integrated at the target recombination site. The terms "a//5" and "a/^P," which refer to 
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attachment (or recombination) sites originally from a bacterial target and a phage donor, 
respectively, are used herein although recombination sites for particular enzymes may have 
different names. The recombination sites typically include left and right arms separated by a 
core or spacer region. Thus, an attB recombination site consists of BOB', where B and B' are 
5 the left and right arms, respectively, and O is the core region. Similarly, attP is POP', where 
P and P' are the arms and O is again the core region. Upon recombination between the attB 
and attP sites, and concomitant integration of a nucleic acid at the target, the recombination 
sites that flank the integrated DNA are referred to as "fl//L" and "a/ri?." The attL and attR 
sites, using the terminology above, thus consist of BOP' and POB', respectively. In some 
10 representations herein, the "O" is omitted and attB and attP, for example, are designated as 
BB' and PP', respectively. 

Description of the Preferred Embodiments 



recombination in eukaryotic cells. Unlike previously known systems for obtaining site- 

1 5 specific recombination, the products of the recombinations performed using the methods of 
the invention are stable. Thus, one can use the methods to, for example, introduce transgenes 
into chromosomes of eukaryotic cells and avoid the excision of the transgene that often 
occurs using previously known site-specific recombination systems. Stable inversions, 
translocations, and other rearrangements can also be obtained. 

20 The invention employs prokaryotic recombinases, such as bacteriophage 

integrases, that are unidirectional in that they can catalyze recombination between two 
complementary recombmation sites, but cannot catalyze recombination between the hybrid 
sites that are formed by this recombination. One such recombinase, the OC3 1 integrase, by 
itself catalyzes only the attB x attP reaction. The integrase cannot mediate recombination 

25 between the attL md attR sites that are formed upon recombination between attB and attP, 
Because recombinases such as the OC31 integrase cannot alone catalyze the reverse 
reaction, the OC31 attB x artP recombination is stable. This property is one that sets the 
methods of the present invention apart fix)m site-specific recombination systems currently in 
use for eucaryotic cells, such as the Cre-Iox or FLP-Fi?r system, where the recombination 

30 reactions can readily reverse. Use of the recombination systems of the mvention provides 



The present invention provides methods for obtaining site-specific 
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new opportunities for directing stable transgene and chromosome rearrangements in 
eukaryotic cells. 

The methods involve contacting a pair of recombination sites (eg., attB and 
atiP) that are present in a eukaryotic cell with a corresponding recombinase. The 
5 recombinase then mediates recombination between the recombination sites. Depending upon 
the relative locations of the two recombination sites, any one of a number of events can 
occur as a result of the recombination. For example, if the two recombination sites are 
present on different nucleic acid molecules, the recombination can result in integration of 
one nucleic acid molecule into a second molecule. Thus, one can obtain integration of a 

10 plasmid that contains one recombination site into a eukaryotic cell chromosome that includes 
the corresponding recombination site. Because the recombinases used in the methods of the 
invention cannot catalyze the reverse reaction, the integration is stable. Such methods are 
useful, for example, for obtaining stable integration into the eukaryotic chromosome of a 
transgene that is present on the plasmid. 

15 The two recombination sites can also be present on the same nucleic acid 

molecule. In such cases, the resulting product typically depends upon the relative orientation 
of the sites. For example, recombination between sites that are in the direct orientation will 
generally result in excision of any DNA that lies between the two recombination sites. In 
contrast, recombination between sites that are in the reverse orientation can result in 

20 inversion of the intervening DNA. Again, the resulting rearranged nucleic acid is stable in 
that the recombination is irreversible in the absence of an additional factor, generally 
encoded by the particular bacteriophage fiom which the recombinase is derived, fiiat is not 
normally found in eukaryotic cells. One example of an application for which this method is 
useful involves the placraient of a promoter between the two recombination sites. If the 

25 promoter is mitially in the opposite orientation relative to a coding sequence that is to be 
expressed by the promoter and the recombination sites Uiat flank the promoter are in the 
inverted orientation, contacting the recombination sites will result in inversion of the 
promoter, thus placing the promoter in the correct orientation to drive expression of the 
coding sequence. Sinodlarly, if the promoter is initially in the correct orientation for 

30 expression and the recombmation sites are in the same orientation, contacting the 
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recombination sites with the promoter can result in excision of the promoter fragment, thus 
stopping expression of the coding sequence. 



chromosomes, for example. In these embodiments, one recombination site is placed on one 
5 chromosome and a second recombination site that can serve as a substrate for recombination 
with the first recombination site is placed on a second chromosome. Upon contacting the two 
recombination sites with a recombinase, recombination occurs that results in swapping of the 
two chromosome arms. For example, one can construct two strains of an organism, one 
strain of which includes the first recombination site and the second strain that contains the 
10 second recombination site. The two strains are then crossed, to obtain a progeny strain that 
includes both of the recombination sites. Upon contacting the sites with the recombinase, 
chromosome arm swapping occurs. 



1 5 integration or other rearrangement of nucleic acids in eukaryotic cells. A recombinase 
system typically consists of three elements: two specific DNA sequences (**the 
recombmation sites**) and a specific enzyme (**the recombinase'*). The recombinase catalyzes 
a recombination reaction between the specific recombination sites. 



20 palindromes. The orientation of the recombination sites in relation to each other determines 
what recombination event takes place. The recombination sites may be in two different 
orientations: parallel (same direction) or opposite. When the recombination sites are present 
on a single nucleic acid molecule and are in a parallel orientation to each other, then the 
recombination event catalyzed by the recombinase is a typically an excision of the 

25 intervening nucleic acid, leaving a single recombination site. When the recombination sites 
are in the opposite orientation, then any intervening sequence is typically inverted. 



specific recombination between a first recombination site and a second recombination site 
that can serve as a substrate for recombination with the first recombination site. However, in 
30 the absence of an additional factor that is not normally present in eukaryotic cells, caimot 



The methods of the invention are also usefiil for obtaining translocations of 



Recombinases and Recombination Sites 



The methods of the invention use recombinase systems to achieve stable 



Recombination sites have an orientation. In otha: words, they are not 



The recombinases used in the methods of the invention can mediate site- 
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mediate recombination between two hybrid recombination sites that are formed upon 
recombination between the jBrst recombination site and the second recombination site. 
Examples of these recombinases include, for example, the bacteriophage OC31 integrase 
{see, e.g., Thorpe & Smith (1998) Proc. Natl Acad Set USA 95: 5505-5510; Kuhstoss & 

5 Rao (1991) J, Mol Biol 222: 897-890; US Patent No. 5,190,871), a phage P4 recombinase 
(Ow & Ausubel (1983) 1 Bacteriol 155: 704-713), a I w/ena phage recombmase, a 
bacteriophage R4 Sre recombinase (Matsuura et al (1996) J. Bacteriol 178: 3374-3376), a 
CisA recombinase (Sato a/. {1990) J. Bacteriol 172: 1092-1098; Stragier e^a/. (1989) 
Science 243: 507-512), an XisF recombinase (Carrasco et al. (1994) Genes Dev. 8: 74-83), 

10 and a transposon ln4451 TnpX recombinase (Bannam et al (1995) Mol Microbiol 16: 535- 
551; Crelin & Rood (1997) J. Bacteriol 179: 5148-5156). 



polypeptides, are described in the art and can be obtained using routine methods. For 
example, a vector that includes a nucleic acid fragment that encodes the <I)C3 1 integrase is 
15 described in US Patent No. 5,190,871 and is available from the Northern Regional Research 
Laboratories, Peoria, Illinois 61604) under the accession nimiber B- 18477. 



recombination sites at which recombination is desired by any suitable method. For example, 
one can introduce the recombinase in polypeptide form, e.g., by microinjection or other 

20 methods. In presently preferred embodiments, however, a gene that encodes the recombinase 
is introduced into the cells. Expression of the gene results in production of the recombinase, 
which then catalyzes recombination among the corresponding recombination sites. One can 
introduce the recombinase gene into the cell before, after, or simultaneoiisly with, the 
introduction of the exogenous polynucleotide of interest. In one embodiment, the 

25 recombinase gene is present within the vector that carries the polynucleotide that is to be 
inserted; the recombinase gene can even be included within the polynucleotide. In other 
embodiments, the recombinase gene is introduced into a transgenic eukaryotic organism, 
e.g., a transgenic plant, animal, fungus, or the like, which is then crossed with an organism 
that contains the corresponding recombination sites. 



Recombinase polypeptides, and nucleic acidfe that encode the recombinase 



The recombinases can be introduced into the eukaryotic cells that contain the 
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Target Organisms 

The methods of the invention are useful for obtaining stable integration 
and/or rearrangement of DNA in any type of eukaryotic cell. For example, the methods are 
useful for cells of animals, plants, fungi, bacteria and other microorganisms. In some 
5 embodiments, the cells are part of a multicellular organism, e.g., a transgenic plant or 
animal. The methods of the invention are particularly useful in situations where transgenic 
materials are difficxUt to obtain, such as with transgenic wheat, com, and animals. In these 
situations, finding the rare single copy insertion requires the prior attainment of a large 
number of independently derived transgenic clones, which itself requires great expenditure 
10 ofefifort. 

Among the plant targets of particular interest are monocots, including, for 
example, rice, com, wheat, rye, barley, bananas, pahns, lilies, orchids, and sedges. Dicots are 
also suitable targets, including, for example, tobacco, apples, potatoes, beets, carrots, 
willows, ehns, maples, roses, buttercups, petunias, phloxes, violets and sunflowers. Other 
15 targets mclude animal and fungal cells. These lists are merely illustrative and not limiting. 

Constructs for Introduction of Exogenous DNA into Target Cells 

The methods of the invention often involve the introduction of exogenous 
DNA into target cells. For example, nucleic acids that include one or more recombination 
sites are often introduced into the cells. The polynucleotide constructs that are to be 
20 introduced into the cells can mclude, in addition to the recombination site or sites, a gene or 
other functional sequence that will confer a desired phenotype on the cell. 



is introduced into the eukaryotic cells in addition to the recombination sites. The 
recombinase-encoding polypeptide can be included on the same nucleic acid as the 
25 recombination site or sites, or can be introduced into the cell as a separate nucleic acid. The 
present invention provides nucleic acids that include recombination sites, as well as nucleic 
acids in which a recombinase-encoding polynucleotide sequence is operably linked to a 
promoter that functions in the target eukaryotic cell. 



30 encoding polynucleotide or transgene of interest) will be present in an expression cassette, 
meaning that the polynucleotide is operably linked to expression control signals, e.g.. 



In some embodiments, a polynucleotide construct that encodes a recombinase 



Generally, a polynucleotide that is to be expressed (e.g., a recombinase- 
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promoters and terminators, that are fimctional in the host cell of interest. The genes that 
encode the recombinase and the selectable marker, will also be under the control of such 
signals that are functional in the host cell. Control of expression is most easily achieved by 
selection of a promoter. The transcription terminator is not generally as critical and a variety 
5 of known elements may be used so long as they are recognized by the cell. 

A proraoter can be derived from a gene that is under investigation, or can be a 
heterologous promoter that is obtained from a diflferent gene, or from a different species. 
Where direct expression of a gene in all tissues of a transgenic plant or other organism is 
desired, one can use a "constitutive" promoter, which is generally active under most 

10 environmental conditions and states of development or cell differentiation. Suitable 

constitutive promoters for use in plants include, for example, the cauliflower mosaic virus 
(CaMV) 35S transcription initiation region and region VI promoters, the 1- or 2 - promoter 
derived from T-DNA of Agrobacterium tumefaciens^ and other promoters active in plant 
cells that are known to those of skill in the art. Other suitable promoters include the full- 

15 length transcript promoter from Figwort mosaic virus, actin promoters, histone promoters, 
tubulin promoters, or the mannopine synthase promoter (MAS). Other constitutive plant 
promoters include various ubiquitin or polyubiquitin promoters derived from, inter alia^ 
Arabidopsis (Sun and Callis, Plant J,, 1 1(5): 101 7-1027 (1997)), the mas, Mac or DoubleMac 
promoters (described in United States Patent No, 5,106,739 and by Comai et aU, Plant MoL 

20 Biol 15:373-381 (1990)) and other transcription initiation regions from various plant genes 
known to those of skill in the art Such genes include for example, ACTll from Arabidopsis 
(Huang etal. Plant MoL Biol 33:125-139 (1996)), Cat3 torn Arabidopsis (GenBankNo. 
U43147, Zhong et al, Mol Gen, Genet 251:196-203 (1996)), the gene encoding stearoyl- 
acyl carrier protein desaturase from Brassica napus (Genbank No. X74782, Solocombe et 

25 al. Plant Physiol 104:1167-1176 (1994)), GPc7 from maize (GenBankNo. XI 5596, 
Martinez et al, J. Mol Biol 208:551-565 (1989)), and Gpc2 from maize (GenBank No. 
U45855, Manjunath et al. Plant Mol Biol 33:97-1 12 (1997)). Usefiil promoters for plants 
also include those obtained fix)m Ti- or Ri-plasmids, from plant cells, plant viruses or other 
hosts where the promoters are found to be fimctional in plants. Bacterial promoters that 

30 fimction in plants, and thus are suitable for use in the methods of the invention include the 
octopine synthetase promoter, the nopaline synthase promoter, and the manopine synthetase 
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promoter. Suitable endogenous plant promoters include the ribulose-l,6-biphosphate 
(RUBP) caiboxylase small subunit (ssu) promoter, the (a-conglycinin promoter, the 
phaseolin promoter, the ADH promoter, and heat-shock promoters. 

Promoters for use in E. coli mclude the T7, tip, or lambda promoters, a 
5 ribosome binding site and preferably a transcription termination signal. For eukaryotic cells, 
the control sequences typically include a promoter which optionally includes an enhancer 
derived from immunoglobulin genes, SV40, cytomegalovirus, e/c, and a polyadenylation 
sequence, and may include splice donor and acceptor sequences. In yeast, convenient 
promoters include GALl-10 (Johnson and Davies (1984) Mol Cell Biol 4:1440-1448) 

10 ADH2 (Russell et al (1983) /. Biol Chem. 258:2674-2682), PH05 (EMBOJ. (1982) 6:675- 
680), and MFa (Herskowitz and Oshima (1982) in TTie Molecular Biology of the Yeast 
Saccharomyces (eds. Strathem, Jones, and Broach) Cold Spring Harbor Lab,, Cold Spring 
Harbor, N.Y., pp. 181-209). 

Alternatively, one can use a promoter that directs expression of a gene of 

1 5 interest in a specific tissue or is otherwise under more precise environmental or 

developmental control. Such promoters are referred to here as "inducible" or **repressible" 
promoters. Examples of environmental conditions that may effect transcription by inducible 
promoters include pathogen attack, anaerobic conditions, ethylene or the presence of light. 
Promoters under developmental control include promoters that initiate transcription only in 

20 certain tissues, such as leaves, roots, fruit, seeds, or flowers. The operation of a promoter 
may also vary depending on its location in the genome. Thus, an inducible promoter may 
become fully or partially constitutive m certain locations. Inducible promoters are often used 
to control expression of the recombinase gene, thus allowing one to control the timing of the 
recombination reaction. Examples of tissue-specific plant promoters under developmental 

25 control include promoters that initiate transcription only in certain tissues, such as friiit, 
seeds, or flowers. The tissue-specific E8 promoter &om tomato is particularly useful for 
directing gene expression so that a desired gene product is located in fiiiits. See, e.g., Lincohi 
et al (1988) Proa. Nat 1 Acad ScL USA 84: 2793-2797; Deikman et al (1988) EMBO J. 7: 
3315-3320; Deikman e/ a/. (1992) P/an/PA;;5zV)/. 100:2013-2017. Other suitable promoters 

30 include those torn genes encoding mibryonic storage proteins. Examples of environmental 
conditions that may affect transcription by inducible promoters include anaerobic conditions. 
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elevated temperature, or the presence of light. Additional organ-specific, tissue-specific 
and/or inducible foreign promoters are also known {see, e.g., references cited in Kuhlemeier 
et al (1987) Ann, Rev. Plant Physiol 38:221), including those 1,5-ribulose bisphosphate 
carboxylase small subunit genes of Arahidopsis thaliana (the "ssu" promoter), which are 
5 light-inducible and active only in photosynthetic tissue, anther-specific promoters (EP 

344029), and seed-specific promoters of, for example, Arabidopsis thaliana (Krebbers et al 
(1988) Plant Physiol 87:859). Exemplary green tissue-specific promoters include the maize 
phosphoenol pyruvate carboxylase (PEPC) promoter, small submit ribulose bis-carboxylase 
promoters (ssRUBISCO) and the chlorophyll a^ binding protein promoters. The promoter 

1 0 may also be a pith-specific promoter, such as the promoter isolated fix)m a plant TrpA gene 
as described in International Publication No. W093/07278. 

Inducible promoters for other organisms include, for example, the arabinose 
promoter, the lacZ promoter, the metallothionein promoter, and the heat shock promoter, as 
well as many others that are known to those of skill in the art. An example of a repressible 

1 5 promoter usefiil in yeasts such as S. pombe is the Pmnt promoter, which is repressible by 
vitaminBl. 

Typically, constructs to be introduced into these cells are prepared using 
recombinant expression techniques. Recombinant expression techniques involve the 
construction of recombinant nucleic acids and the expression of genes in transfected cells. 

20 Molecular cloning techniques to achieve these ends are known in the art. A wide variety of 
cloning and in vitro ampUfication methods suitable for the construction of recombinant 
nucleic acids are well-known to persons of skill. Examples of these techniques and 
instructions sufficient to direct persons of skill through many cloning exercises are found in 
Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology, 

25 Volume 152, Academic Press, Inc., San Diego, CA (Berger); and Current Protocols in 
Molecular Biology, F.M. Ausubel et al, eds.. Current Protocols^ a joint venture between 
Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (1998 Supplement) 
(Ausubel). 

The construction of polynucleotide constructs generally requires the use of 
30 vectors able to replicate in bacteria. A plethora of kits are commercially available for the 
purification of plasmids fipom bacteria. For their proper use, follow the manufacturer's 
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instructions {see^ for example, EasyPrepJ, FlexiPrepJ, both from Phannacia Biotech; 
StrataCleanJ, from Stratagene; and, QIAexpress Expression System, Qiagen). The isolated 
and purified plasmids can thra be frirther manipulated to produce other plasmids, used to 
transfect cells or incorporated into Agrobacterium tumefaciens to infect and transform 
5 plants. Where Agrobacterium is the means of transformation, shuttle vectors are constructed. 
Cloning in Streptomyces or Bacillus is also possible. 

Selectable markers are often incorporated into the polynucleotide constmcts 
and/or into the vectors that are used to introduce the constructs into the target cells. These 
markers permit the selection of colonies of cells containing the polynucleotide of interest. 

10 Often, the vector will have one selectable marker that is fimctional in, e.g., E. coli, or other 
cells in which the vector is replicated prior to being introduced into the target cell. Examples 
of selectable markers for -E. coli include: genes specifying resistance to antibiotics, 
ampicillin, tetracycline, kanamycin, erythromycin, or genes conferring other types of 
selectable enzymatic activities such as p-galactosidase, or the lactose operon. Suitable 

15 selectable markers for use in mammaUan cells include, for example, the dihydrofolate 
reductase gene (DHFR), the thymidine kinase gene (TK), or prokaryotic genes conferring 
drug resistance, gpt (xanthine-guanine phosphoribosyltransferase, which can be selected for 
with mycophenolic acid; neo (neomycin phosphotransferase), which can be selected for with 
G418, hygromycin, or puromycin; and DHFR (dihydrofolate reductase), which can be 

20 selected for with methotrexate (MuUigan & Berg (1981) Proc. Nat 7. Acad. Sci, USA 78: 
2072; Soutiiem & Berg (1982) J, Mol Appl Genet, 1: 327). 

Selection markers for plant cells often confer resistance to a biocide or an 
antibiotic, such as, for example, kanamycin, G 418, bleomycin, hygromycin, or 
chloramphenicol, or herbicide resistance, such as resistance to chlorsulftiron or Basta. 

25 Examples of suitable coding sequences for selectable markers are: the neo gene which codes 
for the enzyme neomycin phosphotransferase which confers resistance to the antibiotic 
kanamycin (Beck et al (1982) Gene 19:327); the hyg (hpt) gene, which codes for tfie enzyme 
hygromycin phosphotransferase and confers resistance to the antibiotic hygromycin (Gritz 
and Davies (1983) Gene 25:179); and the bar gene (EP 242236) that codes for 

30 phosphinothricin acetyl transferase which confers resistance to the herbicidal compounds 
phosphinothricin and bialaphos. 
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If more than one exogenous nucleic acid is to be introduced into a target 
eukaiyotic cell, it is generally desirable to use a different selectable marker on each 
exogenous nucleic acid This allows one to simultaneously select for cells that contain both 
of the desired exogenous nucleic acids. 

Methods for Introducing Constructs into Target Cells 

The poljmucleotide constructs that include recombination sites and/or 
recombinase-encoding genes can be introduced into the target cells and/or organisms by any 
of the several means known to those of skill in the art. For instance, the DNA constructs can 
be introduced into plant cells, either in culture or in the organs of a plant by a variety of 
conventional techniques. For example, the DNA constructs can be introduced directly to 
plant cells using biolistic methods, such as DNA particle bombardment, or the DNA 
construct can be introduced using techniques such as electroporation and microinjection of 
plant cell protoplasts. Particle-mediated transformation techniques (also known as 
*n}iolistics'') are described in Klein et al, Nature, 327:70-73 (1987); Vasil, V. et al, 
Bio/Technol 1 1:1553-1558 (1993); and Becker, D. et al. Plant J., 5:299-307 (1994). These 
methods involve penetration of cells by small particles with the nucleic acid either within the 
matrix of small beads or particles, or on the surface. The biolistic PDS-1000 Gene Gun 
(Biorad, Hercules, CA) uses helium pressure to accelerate DNA-coated gold or tungsten 
microcarriers toward target cells. The process is appUcable to a wide range of tissues and 
cells from organisms, including plants, bacteria, fungi, algae, intact animal tissues, tissue 
culture cells, and animal embryos. One can employ electronic pulse delivery, which is 
essentially a mild electroporation format for live tissues in animals and patients. Zhao, 
Advanced Drug Delivery Reviews 17:257-262 (1995). 

Other transformation methods are also known to those of skill in the art. 
Microinjection techniques are known in the art and well described in the scientific and patent 
literature. The introduction of DNA constructs using polyethylene glycol (PEG) precipitation 
is described in Paszkowski et al, EMBO J, 3:2717 (1984). Electroporation techniques are 
described in Fromm et aL, Proc. Natl Acad. Set USA, 82:5824 (1985). PEG-mediated 
transformation and electroporation of plant protoplasts are also discussed in Lazzeri, P., 
Methods Mol Biol 49:95-106 (1995), Methods are known for mtroduction and expression of 
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heterologous genes in both monocot and dicot plants. See, e.g., US Patent Nos. 5,633,446, 
5,317,096, 5,689,052, 5,159,135, and 5,679,558; Weising etaL {im)Ann. Rev. Genet 
ll'All-An, Transformation of monocots in particular can use various techniques including 
electroporation (e.g., Shimamoto et al. Nature (1992), 338:274-276); biolistics (e.g., 
5 European Patent Application 270,356); and Agrobacterium Bytebier et al, Proc, Nat'l 
Acad. ScL USA (1987) 84:5345-5349). 

For transformation of plants, DNA constructs may be combined with suitable 
T-DNA flanking regions and introduced into a conventional Agrobacterium tumefaciens host 
vector. The virulence functions of the A. tumefaciens host will direct the insertion of a 

10 transgene and adjacent marker gene(s) (if present) into the plant cell DNA when the cell is 
infected by the bacteria. Agrobacterium /z/TW^riew^-meditated transformation techniques 
are well described in the scientific literature. See, for example, Horsch et al Science, 
233:496-498 (1984), Fraley et al, Proc. Natl Acad. Sci. USA, 80:4803 (1983), and 
Hooykaas, Plant Mol Biol, 13:327-336 (1989), Bechtold et al, Comptes Rendus De L 

1 5 Academic Des Sciences Serie lii-Sciences De La Vie-Life Sciences, 316: 1 194-1 199 (1993), 
Valvekens et al, Proc. Natl Acad. ScL USA, 85:5536-5540 (1988). For a review of gene 
transfer methods for plant and cell cultures, see, Fisk et al, Scientia Horticulturae 55:5-36 
(1993) and Potrykus, CIBA Found Symp. 154:198 (1990). 

Other methods for delivery of polynucleotide sequences into cells include, for 

20 example Uposome-based gene delivery (Debs and Zhu (1993) WO 93/24640; Mannino and 
Gould-Fogerite (1988) BioTechniques 6(7): 682-691; Rose U.S. Pat No. 5,279,833; Brigham 
(1991) WO 91/06309; and Feigner et al (1987) Proc. Natl Acad. Sci. USA 84: 7413-7414), 
as well as use of viral vectors {e.g., adenoviral {see, e.g., Bems et al (1995) Ann, NY Acad. 
Sci. 772: 95-104; M et al (1994) Gene Ther. 1: 367-384; and Haddada et al (1995) Curr. 

25 Top. Microbiol Immunol 199 ( Pt 3): 297-306 for review), papillomaviral, retroviral {see, 
e,g, Buchscher et al (1992) X Virol 66(5) 2731-2739; Johann et al (1992) J. Virol 66 
(5):1635.1640 (1992); Sommerfelt et al, (1990) Virol 176:58-59; Wilson et al (1989)7. 
Virol 63:2374-2378; Miller et al., J. Virol 65:2220-2224 (1991); Wong-Staal et al, 
PCT/US94/05700, and Rosenburg and Fauci (1993) in Fundamental Immunology, Third 

30 Edition Paul (ed) Raven Press, Ltd., New York and the references therein, and Yu et al.. 
Gene Therapy (1994) supra.), and adeno-associated viral vectors {see. West et al (1987) 
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Virology 160:38^7; Carter et al (1989) U.S. Patent No. 4,797,368; Carter et al WO 
93/24641 (1993); Kotin (1994) /foman Gene 77iera;3y 5:793-801; Muzyczk^ Clin. 
Invst 94:1351 and Samulski (supra) for an overview of AAV vectors; see also, Lebkowski, 
U.S. Pat. No. 5,173,414; Tratschin et al (1985) Mol Cell Biol 5(1 1):3251-3260; Tratschin 
5 et al (1984) Mol Cell Biol, 4:2072-2081; Hermonat and Muzyczka (1984) Proc. Natl 
Acad. Set USA, 81 :6466-6470; McLaughlin et al (1988) and Samulski et al (1989) 
Virol, 63:03822-3828), and the like. 

Methods by which one can analyze the integration pattern of the introduced 
exogenous DNA are well known to those of skill in the art. For example, one can extract 
10 DNA from the transformed cells, digest the DNA with one or more restriction enzymes, and 
hybridize to a labeled fragment of the polynucleotide construct. The inserted sequence can 
also be identified using the polymerase chain reaction (PCR). See, e.g., Sambrook et al. 
Molecular Cloning - A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York, 1989 for descriptions of these and other suitable methods. 

1 5 Regeneration of Transgenic Plants and Animals 

The methods of the invention are particularly useftil for obtaining transgenic 
and chimeric multicellular organisms that have a stably integrated exogenous polynucleotide 
or other stable rearrangement of cellular nucleic acids. Methods for obtaining transgenic and 
chimeric organisms, both plants and animals, are well known to those of skill in the art. 

20 Transforaied plant cells, derived by any of the above transformation 

techniques, can be cultured to regenerate a whole plant which possesses the transformed 
genotype and thus the desired phenotype. Such regeneration techniques rely on 
manipulation of certain phytohormones in a tissue culture growth medium, typically relying 
on a biocide and/or herbicide marker which has been introduced together with the desired 

25 nucleotide sequences. Plant regeneration from cultured protoplasts is described in Evans et 
al. Protoplasts Isolation and Culture, Handbook of Plant Cell Culture, pp* 124-176, 
Macmillian Publishing Company, New York (1983); and in Binding, Regeneration of 
Plants, Plant Protoplasts, pp. 21-73, CRC Press, Boca Raton, (1985). Regeneration can also 
be obtained from plant callxis, explants, somatic embryos (Dandekar et al, J. Tissue Cult 

30 Metk, 12:145 (1989); McGranahan et al. Plant Cell Rep., 8:512 (1990)), organs, or parts 
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thereof. Such regeneration techniques are described generally in Klee et al, Ann, Rev. of 
Plant Phys., 38:467-486 (1987). 

The methods are useful for producing transgenic and chimeric animals of 
most vertebrate species. Such species include, but are not limited to, nonhuman mammals, 
5 including rodents such as mice and rats, rabbits, ovines such as sheep and goats, porcines 
such as pigs, and bovines such as cattle and buffalo. Methods of obtaining transgenic 
animals are described in, for example, Puhler, A., Ed., Genetic Engineering of Animals, 
VCH PubL, 1993; Muiphy and Carter, Eds., Transgenesis Techniques : Principles and 
Protocols (Methods in Molecular Biology, Vol. 18), 1993; and Pinkert, CA, Ed., Transgenic 
1 0 Animal Technology : A Laboratory Handbook, Academic Press, 1 994. Transgenic fish 
having specific genetic modifications can also be made using the claimed methods. See, 
eg., Iyengar et al (1996) Transgenic Res. 5: 147-166 for general methods of making 
transgenic fish. 

One method of obtaining a transgenic or chimeric animal having specific 

1 5 modifications in its genome is to contact fertilized oocytes with a vector that includes the 
polynucleotide of interest flanked by recombination sites. For some animals, such as mice 
fertilization is performed in vivo and fertilized ova are surgically removed. In other animals, 
particularly bovines, it is preferably to remove ova firom live or slaughterhouse animals and 
fertilize the ova in vitro. See DeBoer et al, WO 91/08216. In vitro fertilization permits the 

20 modifications to be introduced into substantially synchronous cells. Fertilized oocytes are 
then cultured in vitro until a pre-implantation embryo is obtained contaming about 16-150 
cells. The 16-32 cell stage of an embryo is described as a morula. Pre-implantation 
embryos containing more than 32 cells are termed blastocysts. These embryos show the 
development of a blastocoel cavity, typically at the 64 cell stage. If desired, the presence of 

25 a desired exogenous polynucleotide in the embryo cells can be detected by methods known 
to ttiose of skill in the art. Methods for culturing fertilized oocytes to the pre-implantation 
stage are described by Gordon et al (1984) Methods Enzymol 101 : 414; Hogan et al 
Manipulation of the Mouse Embryo: A Laboratory Manual, C,S.H.L. N.Y. (1986) (mouse 
embryo); Hanuner et al (1985) Nature 3 1 5: 680 (rabbit and porcine embryos); Gandolfi et 

30 al (1987) 1 Reprod. Pert. 81 : 23-28; Rexroad et al (1988) J. Anim. ScL 66: 947-953 (ovme 
embryos) and Eyestone et al (1989) J. Reprod. Pert. 85: 715-720; Camous et al (1984) /. 
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Reprod. Pert, 72: 779-785; and H^an et al (1987) Theriogenology 27: 5968 (bovine 
embryos). Sometimes pre-implantation embryos are stored fipozen for a period pending 
implantation. Pre-implantation embryos are transferred to an appropriate female resulting in 
the birth of a transgenic or chimeric animal depending upon the stage of development when 
5 the transgene is integrated. Chimeric mammals can be bred to form true germline transgenic 
animals. 

Alternatively, the methods can be used to obtain embryonic stem cells (ES) 
that have a single copy of the desired exogenous polynucleotide. These cells are obtained 
frompreimplantation embryos cultured in vitro. See, e,g.. Hooper, ML, Embryonal Stem 

10 Cells : Introducing Planned Changes into the Animal Germline (Modem Genetics, v. 1), 
Int'l. Pub. Distrib., Inc., 1993; Bradley et al (1984) Nature 309, 255-258. Transformed ES 
cells are combined with blastocysts from a non-human animal. The ES cells colonize the 
embryo and in some embryos form the germ line of the resulting chimeric animal. See 
Jaenisch, Science, 240: 1468-1474 (1988). Alternatively, ES cells or somatic cells that can 

1 5 reconstitute an organism ("somatic repopulating cells*') can be used as a soiirce of nuclei for 
transplantation into an enucleated fertilized oocyte giving rise to a transgenic mammal. See, 
e.g, Wihnut et al (1997) Nature 385: 810-813. 

EXAMPLES 

The following examples are offered to illustrate, but not to limit the present 

20 invention. 

Example 1 

The <DC31 Recombination System Functions in Schizosaccharomvces pombe 

This Example demonstrates that the Streptomyces bacteriophage OC31 site- 
specific recombination system fimctions in eukaryotic cells. A bacteriophage attachment site 
25 (attP) was introduced into a chromosome of Schizosaccharomyces pombe at the S. pombe 
leul locus. This target strain was subsequently transformed with a plasmid that contains the 
bacterial attachment site {attB) linked to a ura4^ selectable maricer. When co-transformed 
with a second plasmid harboring the OC31 integrase gene, high eflSciency transformation to 
Ura"^ was observed under conditions where the integrase gene was expressed. 
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Southern analysis of the integration events shows insertion of the attB'Ura4^ 
plasmid into the attP site of the leul locus. Nucleotide sequence of the hybrid junctions 
revealed that the attB x attP recombination reaction is precise. 

Materials and Methods 

5 Recombinant DNA 

Standard mefliods were used throughout. E. coli strain XL2"Blue {recAl 
endAl gyrA96 thi-1 hsdR17 supE44 relAl lac \F*proAB lacFL ZAMJSTnlO (Tet^ Amy 
Cam^, Strategene) served as host for DNA constructs. 

Media 

1 0 Fission yeast strains were grown on minimal medium (EMM-low glucose, 

from BiolOl) supplemented as needed with 225 mg/1 adenine, histidine, leucine or uracil. 
Minimal plates with 5-FOA (5-floroorotic acid, from Zymo Research, Inc.) were prepared 
according to Grimm et al ((1988) Mol Gen. Genet 215: 81-86) and were supplemented 
with adenine, histidine, and leucine. When used, thiamine was added to 5 iig/ml. 

15 S. pombe with <fiC31 attP target 

The 84 bp OC3 1 attP site (abbreviated as PP'), isolated as an ApahSacl 
fragment from pHS282 (Thorpe & Smith (1998) Proc, Natl Acad, Sci. USA 95:5505-5510) 
was cloned into the same sites of the S. pombe integrating vector pJK148 (Keeney & Boeke 
(1994) Genetics 136:849-856) to make pLT44. This plasmid was targeted to the S. pombe 

20 leul '32 allele by lithium acetate mediated transformation with Ndel cut DNA. The recipient 
host FY527 (h' ade6'M2J6 hisS-Dl leul'32 ura4-D18\ converted to Leu*^ by homologous 
recombination with pLT44, was examined by Southern analysis. One Leu^ transformant, 
designated FY527attP, was found to contain a single copy of pLT44. Another transfonnant, 
designated FY527attPx2, haibors a tandem plasmid insertion. 

25 Integrative ura4'*" vector with <K31 attB site 

The S. pombe ura4* gene, excised from pTZura4 (S. Forsburg) on a 1.8 kb 
EcoKl'BamHl fragment, was inserted into pJK148 cut with the same enzymes to create 
pLT40. The <I>C31 attB site (abbreviated as BBO, isolated from pHS21 as a 500 bp BamHI' 
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Xbal fragment, was ligated into pLT40 cut with those enzymes, creating pLT42. Most of the 
leul gene was removed from pLT42 by deleting a A%oI fragment to create pLT45. This 
removed all but 229 bp of leul from pLT45 and reduced its transformation efficiency to that 
of a plasmid without any leul homology. pLT50, which has a second atiB site in the same 
5 orientation immediately on the other side of ura4, was constructed by first subcloning the 
attB Bamm-SacI Augment from pLT42 mto pUC19, excising it with EcoRI and SaUl, and 
subsequently inserting it into pLT45 cut with EcoRl mdXhoL The second attB site in the 
final construct was sequenced once on each strand and found to be identical to the first attB 
site. 

10 Linear DNA transformation 

The attB-ura4^'attB linear DNA was prepared as an AttR-AlwNl fiagment 
purified from pLT50, or as a PGR product using pLT50 as template. PGR was conducted 
using standard conditions with a T3 primer and a second primer (5' ggc cct gaa att gtt get tct 
gcc 3') corresponding to the plasmid backbone of pJK148. 

15 Repressible synthesis of (PC31 integrase 

The iS. pombe Pmnt promoter, repressible by vitamin B 1 , was excised as a 1 ,2 
kb PstVSacl fragment from pM0147 and inserted into the his3'^, arsl vector pBG2 (Ohi et 
al (1996) Gene 11 A: 315-318) cut with the same enzymes, creating pLT41. A 2.0 kb Sacl 
fragment containing the <[)C31 int coding region was transferred fixtm pHS33 (Thorpe & 

20 Smith (1998) supra) to the Sacl site of pLT41 . A clone in which the int coding region is 
oriented such that expression is under the control of Pmnt was designated pLT43. 

Molecular analyses 

Southem analysis was performed using the Genius™ system from Boehringer 
Mannheun. A 998 bp internal EcoRV Segment of leul, a 1.8 kb fragment of ura4 , and the 
25 2.0 kb OG3 1 int gene were digoxigen-labeled by the random primer method and used as 
probes. Polymerase chain reaction was performed on a Perkin Ehner Getus Gene Amp PGR 
9600 using Stratagene Turbo PFU enzyme or VENT polymerase. The standard T3 and T7 
primers were used where possible. The ura4 primer (5' gtc aaa aag ttt cgt caa tat cac 3* 
(SEQ E) NO: 1)) and the pJKl48 primers were purchased from Operon Technologies. For 
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all PCR reactions an annealing temperature of 51°C and a 30-second extension time were 
used. 

Results and Discussion 

Inserting a target site into the S. pombe Rename 
5 To create a host strain with a target site for OC3 1 -mediated integration, the 

Q>CZ\ attP site was inserted by homologous recombination into the leul locus of the fission 
yeast genome to form the Leu"*" strain FY527attP (Figure 1 A). Previous studies showed that 
when S, pombe DNA is cleaved with Xbal and probed with an internal 1 kb fi-agment of the 
leu\^ gene, the probe detects a 14 kb band (Keeney & Boeke (1994) Genetics 136: 849-856). 
10 Insertion of the leu**" plasmid pJK148 at the ieul-32 locus results in detection of 3 and 18 kb 
bands (Figure 1 A). Since pLT44 differs &om pJK148 by the inclusion of an 84 bp <I)C3 1 
att? element, integration of pLT44 at leuUSl yielded the same 3 kb and 18 kb hybridization 
pattem in FY527ar/P, The absence of other hybridizing fragments indicates that the pLT44 
DNA resides as a single integrated copy. 

1 5 <PC31-integase-mediated transformation 

FY527atttP was transformed with pLT45, which harbors ura4^ and an attB 
sequence (BB*) but lacks an origin of repUcation. This construct was introduced by itself or 
with pLT43, a his3^ replicating vector that produces OC3 1 integrase. The inclusion of 
pLT43 increased the number of Ura^ transformants an average of 15 fold (Table 1). This 

20 enhancement cannot be attributed to the recombination between pLT45 and the rephcation- 
proficient pLT43, as its effect is dependent on integrase gene expression. Transcription of 
the integrase gene is under the control of Pmnt^ a promoter repressible by high levels of 
vitamin Bl (Maundrell, K. (1993) Gene 123: 127-130). The repression is not absolute 
(Forsburg, S. L. (1993) Nucleic Acids Res 21: 2955-2956) but reduces the production of 

25 integrase protein. When thiamine was added to the growth medium, the number of Ura^ 
transformants decreased to near background level. The frequency of Ura^ tranformants did 
not change significantly whether or not the integrase plasmid was co-selected by omission of 
histidine from the medimn. The transformation competency of FY527atttP was estimated 
from the niunber of His"*^ transformants obtained with pLT43 or its progenitor plasmid pBG2. 
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Compared to the frequency of either replicating plasmid, the pLT43-dependent 
transformation of FY527attP averaged about 15%. 



Table 1 : Integrase-dependent site-specific insertion in S. pombe FY527attP, 





^election 


X>1 


Transformants 

per 10 cells 
(±sd)* 


Relative 
Value* 


Class a 


Class b 


Others 


pLT43 


ffis* 




7200 (±2200) 


100 








pLT45 


Ura* 




63 (±10) 


1 


0%* 


0%* 


100%* 


pLT45 + 






1100 (+120) 


15 


88%^ 




6%* 


pLT43 


Ura* 










pLT45 + 


Ura* 




120 (±16) 


2 


0%* 


25%* 


75%* 


pLT43 













♦From three independent experiments 

^(transformation efficiency of the DNA of interest)/(transfonnation efficiency of pLT43) x 
100 

^n=16 

*n=8 



(KSl'integrase promoted attP x attB recombination 

Recombination between the pLT45-encoded <I>C31 attB element and the 
chromosomally situated attP sequence would incorporate the circular DNA into the leul 
locus as depicted m Figure IB. If this reaction occurs, ^al-fractionated genomic DNA 
from the Ura^ transformants is probed with leul DNA, the 3 kb band will remain unchanged, 
while the 18 kb band will increase to -23 kb (Figure IC), Randomly selected Ura^ colonies 
were examined by hybridization analysis. Of eight isolates derived from experiments where 
<I>C31 integrase gene expression was derepressed by the omission of thiamine, seven showed 
the presence ofthis --23 kb band. This same size band hybridized to the ura4 probe. This 
contrasts with the lack of ura4 hybridization with the parental strain, as expected from its 
urfl4-Dl 8 deletion allele. One of these seven isolates showed additional bands hybridizing 
to both probes. This candidate appears to have a DNA rearrangement at the leul locus in 
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addition to a site-specific recombination event. The leul reanangemmt was probably 
catalyzed by the operative S. pombe homologous recombination system. The remaining 
isolate had not experienced a site-specific recombination event and speared to have gained 
uracil prototrophy by recombination between pLT45 and pLT43* Of these eight isolates, 
5 half were selected as both Ura^ and His^, but no significant difiference was found between 
this group and the group selected for Ura* only. 

From transformation experiments plated in the presence of vitamin Bl, an 
equal number of Ura* transformants was examined by DNA hybridization. The thiamine- 
repressible Pnmt promoter is expected to limit integrase production, and thereby site-specific 

1 0 integration. Two of the eight Ura^ candidates isolated from this low frequency 

transformation showed a band of 23 kb hybridizing to leul and to the ura4 probe. However, 
since both probes detected an additional band, they do not represent correct integration 
events, and we grouped them as class b integrants. In the other six isolates, the hybridization 
patterns are difficult to interpret In some of them, the 3 kb band was not detected by the 

15 leul probe, as though the locus has experienced some rearrangement. In many of them, the 
weak hybridization to ura4 suggests that the Ura^ phenotype may not be due to the stable 
maintenance of pLT45 in the genome. 

To ascertain the proportion of transformants maintaining the integrase 
plasmid in the absence of selection, the blots were re-probed with the integrase gene 

20 sequence. Those selected as Ura^ His"*" would be expected to maintain the plasndd, and did 
so, as the hybridization revealed. Five of the eight isolates selected as Ura**^ without regard to 
the His phenotype also gave bands hybridizing to the mtegrase probe. To confirm that loss 
of zn^ would not affect stable integration, another set of randomly chosen Ura^ cells were 
grown non-selectively for a number of generations and screened for His" progeny that have 

25 lost pLT43. The analysis of eight representative Ura^ His" clones showed that all had a 
single copy of pLT45 precisely integrated at the chromosome-situated attP site. The DNA 
of these integrants did not hybridize with the mtegrase probe. In contrast, the background 
fi^uency Ura* clones derived by transformation of pLT45 alone gave the parental 
configuration of hybridizing bands at the leul locus and additional faint bands at 5 kb and 7 

30 kb. These observations are consistent with either integration of pLT45 elsewhere in the 
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genome, or maintenance of the plasmid in some cells despite the lack of a S, pombe 
replication origin. 

Conservative site-specific recombination 

PGR was used to retrieve the attPlattB recombinant junctions fiom three 
representative Ura**" candidates. One of the hybrid sites, attR (PB') would be flanked by T3 
and T7 promoters; the other site, attL (BP*) by the T3 promoter and ura4 DNA (Figure IC). 
In each case, primer pairs directed to these sequences amplified a band of the expected size 
while the original attP (PP') was no longer found. This contrasts with the parental strain 
iFY527attP, where attP, but neither attL nor attR, was detected. The nucleotide sequence of 
three representative attL and attR PGR products showed the absence of accompanying 
mutations. Hence, as in bacteria and mammalian cells, OCSl mediated site-specific 
recombination in S. pombe is a conservative recombination reaction. 

iPC31 integrase does not excise integrated molecules 

Thoipe and Smith ((1998) Proc. Natl Acad. ScL USA 95: 5505-5510) did not 
detect reversal of the Q>a 1 integrase reaction by analysis of gel-fi-actionated DNA 
firagments. We examined the possibility of a reverse reaction through a genetic selection 
strategy. The precise integration of pLT45 into FY527attP was confirmed for three clones 
by Southem analysis; these strains were then re-transformed with pLT43. Excision of 
pLT45 would result in loss of the ura4^ marker; the Ura' phenotype can be scored on plates 
with 5-FOA (Grimm etal (1988) M)/. Gen. Genet. 215: 81-86). The fi-equencies of Ura* 
segregants fix)m cultures of the three Ura^ His' progenitors were 5.7 x 10"^, 7.1 x 10"^ and 5.6 
X 10"^. In contrast, the firequencies of Ura" colonies from the three Ura^ His* derivatives were 
somewhat higher: 1.1 x 10■^ 3.8 x 10'^ and 2,3 x 10'\ respective 19-, 5- and 4-fold increases. 
When a control vector lacking the integrase gene, pBG2, was used instead, increased rates 
of 5-FOA resistance were also found: 1.0 x 10"^, 1.0 x 10'^, and 8.0 x 10'^ respectively. The 
transformation process itself appears mutagenic. 

Three Ura" His"*" clones fi:om each of the three cultures that had been 
transformed by pLT45 were analyzed by Southem blotting. One isolate had a DNA pattern 
consistent with stable integration of pLT45 into FY527attP. Therefore, in this clone, the 
Ura* phenotype was caused by a mutation that did not appreciably alter the restriction 
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pattern, rather than by reversal of the site-specific recombination reaction. The second clone 
showed a Southern pattern characteristic of FY527attP lacking a pLT45 insertion, the third 
had a pattern consistent with a mixture of two types of cells, those like FY527attP without a 
pLT45 insertion, and those like the FY527attP progenitor stram FY527. The latter structure 
could arise from mtrachromosomal homologous recombination between the leu repeats, 
reversing the insertion of pLT44 (Figure L4). If precise excision of the integrated plasraid 
DNA occurred in the latter two candidates, the attP site would be regenerated; this would be 
detectable with PGR. The size of the PGR product was that expected for an intact hybrid 
site, the presence of the hybrid site was confirmed by sequencing the PGR product. These 
observations are consistent with the idea that deletion of the ura4 gene occurred by some 
mechanism other than OG31 -mediated excision. 

Summaiy 

The integration of a circular molecule at a single target site was an efficient 
process yielding precise uisertions in nearly all transformants. The few aberrant events we 
observed are probably largely attributable to the S. pombe recombination system acting on 
the leul repetitive DNA. When integrase production was limited through the repression of 
its promoter, the number of transformants was reduced to near background level Under 
these conditions, few of the recovered transformants were derived from a>G3 1 site-specific 
recombination. Functional operation of the OG31 site-specific recombination system in 
eukaryotic cells presents new opportunities for the manipulation of transgenes and 
chromosomes. The a>G3 1 system can be used with selective placement of attB and attP sites 
to delete, mvert or insert DNA. An important feature of this system is that the attB x attP 
reaction is irreversible in the absence of an excision-specific protein. 

Example 2 

The 0>C31 Integ rase Functions in CHO Cells to Create Stable Integration 
This Example describes an experiment in which the C>G3 1 integrase was 
tested for ability to mediate recombination between attB and attP recombination sites in 
Chinese hamster ovary (CHO) cells. 
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Methods 



The CHO cell line 5 1 YT21 1 was transfected with the attP-contairung plasmid 



pFYl, which included a selectable marker that confers zeocin resistance (Figure 2), After 
being single colony purified twice, six zeocm resistant cell lines were isolated. Analysis by 
5 Southern DNA hybridization confinned that each of the six cell lines had at least one 
molecule of pFYl integrated into the genome. 



pFY9 and the z/ir-^ontaining plasmid pFY6 to test for site-specific recombination between 
the attB sites on jiFY9 and the attP site on the chromosomal copy of pFY'l. As control, the 

10 same cell hnes were transfected with pFY9, but without the z/iz-containing pFY6. The pFY9 
plasmid included a neomycin resistance selectable marker under the control of an SV40 
early promoter, as well as a green fluorescent protein (GFP) coding sequence that is not 
linked to a promoter. Site-specific recombination would thus be expected to place the GFP 
coding sequence under the control of a human cytomegalovirus promoter that was included 

15 in pFYl , resulting in expression of GFP. 

Results 



microscope to observe whether the GFP gene is active. A large percentage of the cells 
transfected with pFY9+pFY6, but only a few of the cells transfected with the pFY9 alone 
20 showed GFP activity. This is consistent with site-specific integration of pFY9 when co- 
transfected with pFY6, and random insertion of pFY9 in the absence of a co-transfected ini 
gene. 



(human cytomegalovirus promoter) and GFP (Figure 2). These primers would be expected 
25 to amplify a band of M).6 kb corresponding to the integration junction. As neomycin 

resistant colonies could arise firom both site-specific integration and random integration, and 
that the GFP marker does not confer a selectable trait, it was difficult to obtain pure cultures 
of mtegrant clones. Therefore, pools of neomycin resistant cells from each transfected line 
were subjected to PGR analysis to examine if the integration junction were present among 
30 the neomycin resistant cells. A band of the expected size of --0.6 kb was obtained from two 



Each of the six cell lines was transfected with the a/tff-containing plasmid 



Transfection results: Neomycin resistant colonies were placed under the 



PGR analysis was conducted using a primer set that corresponds to Pc 
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lines. This indicates that the attB x attP recombination junction has fomed linking Pc with 
GFP, 

Example 3 

(DC31 Integrase Catalyzes Site^Specific Recombination in CHO Cells 
5 This Example describes a second experiment in which the <t>C3 1 integrase 

was tested for ability to integrate a DNA molecule into the chromosome of Chinese hamster 
ovary (CHO) cells through the recombination between attP and attB sites. 

Methods 

Plasmid constructs 

10 Chromosomal attB target constructs pFY12. pFY14 and pFYlS 

The plasmid pcDNAB. 1/His/lacZ (Invitrogen) was used as a vector backbone. 
A synthetic oligonucleotide contained dififerent length of the attB site, flanked hy HinSSl 
and Kpnl sites, was inserted between the HinSSL (AAGCTT) and Kpnl (GGTACC) sites of 
pcDNA3.1/His/lacZ. 

15 The plasmid pFY12 contains 90 bp of the attB sequence (AAGCTT 

gacggtctcg aagccgcggt gcgggtgcca gggcgtgccc ttgggctccc cgggcgcgta ctccacctca cccatctggt 

ccatcatgat GGTACC) (SEQ ID NO: 2). 

The plasmid pFY14 contamed 50 bp of the attB site (AAGCTT gcgggtgcca 

gggcgtgccc ttgggctccc cgggcgcgta ctccacctca TGGTACC) (SEQ ID NO: 3). 
20 The plasmid pFYl 5 contained 30 bp of attB (AAGCTT ccagggcgtg 

cccttgggct ccccgggcgc ATGGTACQ (SEQ ID NO: 4). 

Integrating attP plasmids pFY17, pFY19, pFY20 

The hpt gene encoding for resistance to hygromycin» obtained as a 1.6 kb 

Bamm to Kpnl fragment from pEDl 13, was inserted between the BamYH and Kpnl sites of 
25 pBluescript n SK to generate the control plasmid pBSK-hpt. 

A synthetic oligonucleotide containing different lengths of the attP site was 

inserted between Sad (GTCGAC) and Bamm (GGATCC) sites in pBSK-hpt to generate the 

following plasmids: 
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a) The plasmid pFY17 contains 90 bp (GAGCTC-g aagcggttt tcgggagtag- 
tgccccaact ggggtaacct ttgagttctc tcagttgggg gcgtagggtc gccgacatga cacaaggggt-GGATCQ of 
attP site (SEQIDNO: 5). 

b) The plasmid pFY19 contains 50 bp of attP site fGAGCTC-t gccccaact 
ggggtaacct ttgagttctc tcagttgggg gcgtagggtc-GGATCQ (SEQ ID NO: 6). 

c) The plasmid pFY20 contains 32 bp of attP site (GAGCTC-actggggtaa 
cctttgagtt ctctcagt tg ggATCO (SEQ E) NO: 7) is caUed pFY20. 

Integrase expressing construct pFY6 

An EcoRl to BamEl fragment containing the nearly complete open reading 
frame of the integrase gene was inserted between the EcoRI and BamJrU sites of 
pcDNA3.1/Zeo(-) (Invitrogen). A synthetic oligonucleotide (GGGCCCGCCACGATGACA 
CAAGGGGTTGTGACCGGGGTGGACACGTACGCGGGTGCTTACGACCGTCAGTCG 
CGCGAGCGCGAGAATTC) (SEQ ID NO: 8) containmg a Kozack sequence and the N- 
terminal amino acid coding sequences of the integrase gene was subsequently inserted 
between the Apal and EcoRl sites to reconstruct the open reading frame. This orientation 
places a complete integrase coding region under the control of the CMV (human 
cytomegalovirus) promoter in pcDNA3.1/Zeo(-). 

Transfection protocol 

The CHO cell Une K-1 was transfected with attB target constructs pFY12, 
pFY14 or pFY15 (Figure 3). These plasmids harbor the selectable marker for neomycin 
resistance, and an attB site of various lengths located between Pc (human cytomegalovirus 
promoter) and the lacZ coding region. Plasmids pFY12, pFY14 and pFY15 contain, 
respectively, 90, 50 and 30 bp of the attB sequence. Neomycin-resistant cell lines were 
obtained from consecutive purification of single colonies. Four lines of each construct were 
used for integration experiments. 

Each of the 12 lines was transfected with pFY6, a Q>Ci 1 integrase expression 
plasmid, along with an integration vector, pFYl 7, pFY19, or pFY20. The plasmids pFY17, 
pFY19 and pFY20 harbor an attP sequence of lengths 90, 50 and 32 bp, respectively. The 
attP sequence is situated upstream of the hpt open reading frame, which encodes 
hygromycin phosphotransferase, an enzyme that confers resistance to hygromycin. There is 
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no promoter upstream of the attP-hpt segment and hpt is therefore not expressed unless the 
plasmid integrates into the genome in such a way that the hpt coding region fuses with a 
genomic promoter. For control, pBSK-hpt was used to monitor the frequency of promoter 
fusion to hpt. The plasmid pBSK-hpt is identical to pFY17, pFY19, and pFY20 except it 
5 lacks an attP sequence. The recombination between aUP and attB sites is expected to insert 
the integration vector into the chromosome target to generate a Pc-attL-hpt linkage. 
Expression of hpt will confer resistance to hygromycin. 



10 integration plasmid which was transfected into the 12 cell lines (Table 2). From 1x10^ cells 
plated, pBSK-hpt transfections failed to produce a significant number of resistant colonies. 
This indicates that the frequency of the hpt coding region fusing to a genomic promoter is 
extremely low. In contrast, pFY17, pFY19 and pFY20 yielded up to a thousand fold higher 
number of hygromycin resistant colonies, depending on the particular mtegration plasmid 

1 5 and the particular cell line. Higher numbers of hygromycin resistant colonies were produced 
from the transfection of pFY19 or pFYl 7 mto FY12 lines. This indicates that the 
recombination between longer attB and attP sequences is more efiBcient than the 
recombination between shorter attB and attP sites. 



20 representative colonies. Primers corresponding to the human cytomegalovirus promoter and 
the hpt coding region amplified a PGR product of the expected size (0.8 kb). This indicates 
that Pc is linked to the hpt coding region, consistent with recombination between flie 
genomic attB site and the plasmid attP sequence. 

Example 4 

25 The a>C31 Integrase Functions in Plant Chromosomes to Recombine attP and attB Sites 
This example describes an e:q)eriment in which the OC31 integrase was 
tested for ability to recombine attP and attB sites that are present in a plant chromosome. 
The constructs and strategy for this experiment are shown in Figure 4. 



Results 



Transfection results: Hygromycin resistant colonies were scored for each 



PGR was used to detect the expected -0.8 Kb junction band bom 
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Table 2 

Number of hygromycin resistant colonies per 1x1 0^ transfected cells. 





Integration plasmids 


Target cell lines 
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pFY20 
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FY12-1 
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856 
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FY12-2 
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FY12-3 
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FY14-3 


23 


240 


45 


0 


FY14-7 


96 


245 


,67 


0 


FY14-8 


21 
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49 




FY14-9 


89 
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FY15-1 
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24 
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0 


FY15-2 


0 


345 


34 


0 


FY15-3 


55 


455 


23 


1 


FY15-4 


0 


0 


0 


0 



5 



Methods 

The construct pWP29 contains the fragment consisting of SSS-attP-npt-attB- 

gusy flanked by RB and LB, where 35S is the cauUflower mosaic virus promoter, npt is the 

coding region for neomycin phosphotransferase, and gtds is the coding region for 
1 0 glucuronidase. RB and LB are the right and left Agrobacterium T-DNA border sequences, 

respectively. The attP site between 35S and npt serves as a non-translated leader sequence. 

Transcription of npt by 35S confers resistance to kanamycin. The gus coding region is not 

transcribed due to the lack of an upstream promoter. 

A second construct used for plant transformation is pWP24. This construct 
1 5 contains the fragment Pnos-npt-SSS-int, flanked by RB and LB, where Pnos is the nopaline 

synthase promoter, and int is the <I>C31 integrase coding region. Both npt and int are 

transcribed from their respective upstream promoters. 
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If the two constructs were present in the same genome, the expression of int 



from the pWP24 bearing chromosome would be expected to produce functional <I>C31 
integrase to catalyze the recombination between attB and attP sites situated on the pWP29- 
bearing chromosome. The recombination event would be expected to delete the npt gene 
5 from the pWP29 construct and fuse 35S to guy. The resulting configuration would be 3 JiS- 
attR'gus, where attR is a hybrid site formed by the recombination between attP and attB, 
also designated as PB' (Figure 4). The deletion of npt brings gus under the transcription of 
35S and would be expected to yield plants with GUS en2yme activity. This activity can be 
detected through histocheraical staining of the plant tissue. 

10 Results 



was functional for recombination. Through the biolistics-mediated delivery of naked DNA, 
pWP29 was cointroduced with pWP8 into maize BMS cells. The construct pWP8 has the 
integrase gene fused behind the maize ubiquitin promoter for expression in monocot cells. 
1 5 Blue spots were observed when both plasmids were co-introduced, but were not found if 
only one of the plasmids was used. This indicated that site-specific recombination took 
place in maize cells and that the attP and attB sites in pWP29 were functional sites. 



mediated transformation using pWP29 or pWP24. Another transient expression assay was 
20 conducted to determine whether the pWP24 lines produced functional integrase. The 

constract pWP29 was introduced into the pWP24 plants through biolistics mediated delivery 
of naked DNA. Cells that take up the pWP29 DNA would be expected to express GUS 
enzyme activity as a result of the formation of a SSS-attR-gus configuration. Indeed, two 
lines, 24,3 and 24.4 yielded blue spots consistent of functional integrase-mediated site- 
25 specific recombination between the attP and attB sites. 



produce progeny with the chromosomes canying pWP29 and pWP24 in the same genome. 
Table 3 summarizes the results from the genetic crosses between integrase (24.3, 24.4) and 
tester lines (29.2, 29.4,29.5, 29.19). In each case, representative progeny seedlings were 
30 germinated in the absence of selection and histochemically stained for GUS enzyme activity. 



A transient expression assay was conducted to determine whether pWP29 



Kanamycin resistant tobacco plants were regenerated by Agrobacteriimi 



These two pWP24 integrase lines were crossed to pAVP29 tester lines to 
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The table lists the number of progeny that stained blue. As the primary transformed pWP24 
and pWP29 hnes are heniiTygous for their respective transgene, only a quarter of the 
progeny would be expected to cany both transgene types. The sample sizes were small, so 
an apparent deviation from the expected frequency is not unusual. 

Table 3: Progeny that showed gus expression from histochemical staining. 



Male Donor plant Female Recipient 


Number of 


Number of 


% positive for 


line 


plant line 


progeny stained 


progeny that 


gus activity 






for gus activity 


show gus 










activity. 




24.3+ 


29.2 


38 


11 


29% 


29.2 


24.3+ 


38 


1 


2.6% 




29.4 


24.3+ 


18 


3 


16% 




24.3+ 


29.5 


38 


4 


10% 


29.5 


24.3+ 


26 


0 


0 




24.4 


29.2 


38 


7 


19% 


29.2 


24.4+ 


38 


7 


19% 




29.4 


24.4+ 


19 


8 


42% 




24.4+ 


29.5 


38 


17 


45% 


29.5 


24.4+ 


20 


6 


30% 




29.19 


24.4+ 


18 


7 


39% 



The intensity of staining varied depending on the combination of lines used as 
1 0 parental lines. Those with progeny with a greater proportion of the tissue staining blue 

indicate that the recombination event was more efficient Conversely, those yielding progeny 
with less uniform staining indicate that the recombination event was less efGcient. This 
variation among the different progeny pools is probably due to effects caused by the position 
of integration of the transgenes. Of the two mtegrase lines, 24.4 appears more efficient in 
1 5 promoting site-specific recombination. This is probably due to a higher level of int gene 

expression. Staining patterns produced by crossing 24.4 to 29.4 and 29.19 are consistent with 
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the experimental design that int promoted site-specific recombination of attB and attP results 
in the activation of gus gene activity. 

Example 5 

OC31 Integrase Catalyzes Inte gration of a Circular Plasmid into a Plant Chromosome 
5 This example describes an experiment in which the <I>C3 1 integrase was 

tested for ability to insert a circular plasmid molecule into the plant chromosome through 
attP X attB site-specific recombination. This experiment is diagranmied in Figure 5. 

Methods 

The target construct pWP6 contains the fi:agment consisting of SSS-attP-npt, 
1 0 flanked by RB and LB. The attP site between 35S and npt serves as a non-translated leader 
sequence. Transcription of npt by 35S confers resistance to kanamycin. 

The integrating construct pYJC43 has the fragment attB-hpt, where hpt codes 
for resistance to hygromycin. The integrase expression construct is pYJC41, in which 35S 
transcribes int. 

1 5 The target construct p WP6 was placed into a plant chromosome through 

random integration of pWP6 DNA. Kanamycin resistant plants harboring a single copy of 
the pWP6 transgene are then subsequently transformed with pYJC43 and pYJC4L The 
transient expression of int fcom pYJC41 was expected to catalyze the recombination 
between the attB site of pYJC43 and the chromosomally-situated attP site of the pWP6 

20 transgene. The specific recombination between attB and attP sites would insert the pYJC43 
circular molecule into the chromosome to generate a SSS-attL-hpt linkage. Note that 
because the attP and attB sites are depicted in the inverted orientation, the attL site will 
Kkewise be in an inverted orientation, or designated P'B, the same as BP' in the drawn in an 
inverted orientatioa A functional 35S-attL-hpt linkage would confer a hygromycin 

25 resistance phenotype. 

Results 

Kanamycin resistant tobacco plants harboring pWP6 were obtained through 
Agrobacterium-mediated transformation. Southern hybridization analysis detected one line 
that harbors a single copy of the pWP6 transgene. Progeny fix>m this line, WP6. 1 , were 
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genninated aseptically and protoplasts were made from these plants. The protoplasts were 
transformed by the combination of pYJC43 and pYJC41 DNA by the polyethylene glycol 
method for direct DNA transformation. The protoplasts were then imbedded into agarose 
and cultured to form calli in the presence of hygromycin. The rate of callus formation in the 
5 absence of hygromycin selection was 4 x lO'^. This is about 10 fold lower than usual, but is 
within the range of variability observed in protoplast transformation experiments. In the 
presence of hygromycin selection, the rate of callus formation was 7 x 10"^, This indicates 
that about 18% of the calli that regmerated from protoplasts contained the integration vector 
at the target site. When the integrase construct pYJC41 was excluded from the 
10 transformation, the rate of callus formation was <1 x 10"^. The higher frequency of 
hygromycin resistant caUi produced by inclusion of the integrase expressing plasmid 
pYJC41 is consistent with the integrase promoted site-specific integration of pYJC43 into 
the chromosomal attP target. 



15 It is understood that the examples and embodimaits described herein are for 

illustrative purposes only and that various modifications or changes in light th^eof will be 
suggested to persons skilled in the art and are to be included within the spirit and purview of 
this application and scope of the appended claims. All publications, patents, and patent 
^pUcations cited herein are hereby incorporated by reference for all purposes. 
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1 1 . A eukaryotic cell that comprises a prokaryotic recombinase polypeptide 

2 or a nucleic acid that encodes a prokaryotic recombinase, wherein the recombinase can 

3 mediate site-specific recombination between a first recombination site and a second 

4 recombination site that can serve as a substrate for recombination with the first 

5 recombination site, but in the absence of an additional factor that is not present in the 

6 eukaiyotic cell caimot mediate recombination between two hybrid recombinase 

7 recombmation sites that are fonned upon recombination between the first recombination site 

8 and the second recombination site. 



1 2, The eukaryotic cell of claim 1 , wherein the recombinase is selected 

2 fix)m the group consisting of a bacteriophage OCS 1 integrase, a coliphage P4 recombmase, a 

3 Listeria phage recombmase, a bacteriophage R4 Sre recombinase, a CisA recombinase, an 

4 XisF recombinase, and a transposon ln4451 TnpX recombmase. 

1 3. The eukaryotic cell of claim 1 , wherein the recombinase is a 

2 bacteriophage OCSl integrase. 

1 4. The eukaryotic cell of claim 1 , wherein the first recombination site is an 

2 attB site and the second recombination site is an attP site. 

1 5. The eukaryotic cell of claim 1 , wherein the cell fiirther comprises a first 

2 recombinase recombination site. 

1 6. The eukaryotic cell of claim 1 , wherein the cell comprises a nucleic acid 

2 that comprises a coding sequence for an recombinase polypeptide, which coding sequence is 

3 operably linked to a promoter that mediates expression of the recombinase-encoding 

4 polynucleotide m the eukaryotic cell. 

1 7. The eukaryotic cell of claim 6, wherein the nucleic acid fiirther 

2 comprises a selectable marker. 
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1 8. The enkaryotic cell of claim 6, wherein the promoter is an inducible or a 

2 repressible promoter. 

1 9. The eukaryotic cell of claim 8, wherein the nucleic acid is the plasmid 

2 pLT43. 

1 1 0. The eukaryotic cell of claim 1 , wherein the eukaryotic cell is selected 

2 from the group consisting of an animal cell, a plant cell, a yeast cell, an insect cell and a 

3 fungal cell. 

1 11. The eukaryotic cell of claim 1 0, wherein the eukaryotic cell is a 

2 mammalian cell. 

1 12. The eukaryotic cell of claim 10, wherein the eukaryotic cell is present in 

2 a multicellular organism, 

1 13. A method for obtaining site-specific recombination in a eukaiyotic cell, 

2 the method comprising: 

3 providing a eukaryotic cell that comprises a first recombiaation site and 

4 a second recombination site, which second recombination site can serve as a substrate for 

5 recombination with the first recombination site; 

6 contacting the first and the second recombination sites with a 

7 prokaiyotic recombinase polypeptide, resulting in recombination between the recombmation 

8 sites, thereby forming one or two hybrid recombination sites; 

9 wherein the recombinase polypeptide can mediate site-specific 

1 0 recombination between the first and second recombination sites, but cannot mediate 

1 1 recombination between two hybrid recombination sites in the absence of an additional factor 

12 that is not present in the eukaryotic cell. 

1 14. The method of claim 13, wherein the eukaiyotic cell is selected fix)m the 

2 group consisting of a yeast cell, a fungal cell, a plant cell, an insect cell and an animal cell. 
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1 15. The method of claim 13, wherein the first recombination site is present 

2 in a chromosome of the eukaryotic cell. 

1 16. The method of claim 15, wherein the second recombination site is 

2 present in a second chromosome of the eukaryotic cell and contacting the first and second 

3 recombination sites with the recombinase results in translocation of chromosome arms. 

1 17. The method of claim 13, wherein the first recombination site and the 

2 second recombination site are present on a single nucleic acid molecule. 

1 18. The method of claim 1 7, wherein the first recombination site and the 

2 second recombination site are in a direct orientation. 

1 19. The method of claim 1 8, wherein the recombination results in excision 

2 of the portion of the nucleic acid molecule that lies between the first and second 

3 recombination sites. 

1 20. The method of claim 17, wherein the first recombination site and the 

2 second recombination site are in an inverted orientation. 

1 21. The method of claim 20, wherein the recombination results in inversion 

2 of the portion of the nucleic acid molecule that Ues between the first and second 

3 recombination sites. 

1 22. The method of claim 13, wherein the eukaryotic cell comprises a 

2 polynucleotide that encodes the recombuiase polypeptide. 

1 23 . The method of claim 22, wherein the recombinase-encoding 

2 polynucleotide is operably linked to a promoter which mediates expression of the 

3 polynucleotide in the eukaryotic cell. 
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1 24. The method of claim 23, wherein the promota: is an inducible or a 

2 repressible promoter. 

1 25. The method of claim 24, wherein the promoter is a Pmnt promoter. 

1 26. A method for obtaining a eukaryotic cell having a stably integrated 

2 transgene, the method comprising: 

3 introducing a nucleic acid into a eukaryotic cell that comprises a first 

4 recombination site, wherein the nucleic acid comprises a transgene and a second 

5 recombination site which can serve as a substrate for recombination with the first 

6 recombination site; and 

7 contacting the first and the second recombination sites with a 

8 prokaryotic recombinase polypeptide, wherein the recombinase polypeptide catalyzes 

9 recombination between the first and second recombination sites, resulting in integration of 

10 the nucleic acid at the first recombmation site, thereby forming a hybrid recombination site 

11 at each end of fiie nucleic acid; 

12 wherein the recombinase polypeptide can mediate site-specific 

13 recombination between the first and second recombination sites, but cannot mediate 

14 recombination between two hybrid recombination sites in the absence of an additional factor 

1 5 that is not present in the eukaryotic cell 

1 27, The method of claim 26, wherein the recombinase polypeptide is 

2 selected fi-om the group consisting of a bacteriophage ®C3 1 integrase, a coliphage P4 

3 recombinase, a Listeria phage recombinase, a bacteriophage R4 Sre recombinase, a CisA 

4 recombinase, an XisF recombinase, and a transposon Tu4451 TnpX recombinase. 

1 28. The method of claim 27, wherein the recombinase is a OC3 1 integrase. 

1 29. The method of claim 26, wherein the recombinase polypeptide is 

2 introduced into the eukaryotic cell by expression of a polynucleotide that encodes the 

3 recombinase polypeptide. 
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1 30, The method of claim 29, wherein the polynucleotide that encodes the 

2 recombinase polypeptide is operably linked to a promoter that functions in the eukaryotic 

3 cell 

1 31. The method of claim 30, wherein the promoter is an inducible or a 

2 repressible promoter. 

1 32. A nucleic acid that comprises a polynucleotide sequence that encodes a 



2 bacterial recombinase polypeptide operably linked to a promoter that functions in a 

3 eukaryotic cell, wherein the recombinase polypeptide cannot mediate recombination between 

4 two hybrid recombination sites that are formed upon recombination between a first 

5 recombination site and a second recombination site in the absence of an additional factor. 



1 33. The nucleic acid of claim 32, wherein the nucleic acid further comprises 

2 at least one recombination site that is recognized by the recombinase polypeptide. 

1 34, The nucleic acid of claim 32, wherein the nucleic acid comprises a 

2 plasmid vector. 

1 35. The nucleic acid of claim 34, wherein the vector is pLT43. 

1 36. A eukaryotic cell that comprises a polynucleotide that comprises a first 

2 bacteriophage OC3 1 recombination site. 

1 37. The eukaryotic cell of claim 36, wherein the recombination site is 

2 selected firom the group consisting of attP and attB. 

1 38. The eukaryotic cell of claim 36, wherein the eukaryotic cell further 

2 comprises a second polynucleotide that comprises a second OC31 recombination site that 

3 undergoes recombination with the first OC31 recombination site when contacted with a 

4 OC3 1 integrase polypeptide. 
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1 39. The eukaryotic cell ofclaim 38, wherein: 

2 the first recombination site is attB and the second recombination site is 

3 anP\ or 

4 the first recombination site is attP and the second recombination site is 

5 attB, 

1 40. The eukaryotic cell ofclaim 38, wherein the second polynucleotide 

2 further comprises a transgene. 

1 41 . The eukaryotic cell ofclaim 38, wherein the second polynucleotide 

2 fiirther comprises a selectable marker. 

1 42, The eukaryotic cell of claim 36, wherein the eukaryotic cell further 

2 comprises a OC31 integrase polypeptide. 

1 43. The eukaryotic cell of claim 36, wherein the eukaryotic cell further 

2 comprises a nucleic acid that comprises a polynucleotide that encodes a <I>C3 1 integrase 

3 polypeptide, 

1 44. The eukaryotic cell ofclaim 43, wherein the nucleic acid further 

2 comprises a selectable marker. 

1 45. The eukaryotic cell ofclaim 43, wherein the nucleic acid further 

2 comprises a promoter which results in expression of the OC3 1 integrase*encoding 

3 polynucleotide in the cell 

1 46, The eukaryotic cell ofclaim 45, wherein the promoter is an inducible 

2 promoter. 

1 47. The eukaryotic cell of clahn 36, wherein the eukaryotic cell is selected 

2 ftom the group consisting of a yeast cell, a fungal cell, a plant cell, and an animal cell. 
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Figure 2 

Transgene Integration in CHO Cell Line 
Site Specific Expression of GFP 
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Figure 3 

Transgene Integration in CHO Cell Line 
Hygromycin Resistance from attB x attP recombination 
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Figure 4 



Excision of DNA from Tobacco Genome 
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Figure 5 

Integration of DNA into the tobacco genome 
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